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DETAILED ACTION 



1. 



This is responsive to the RCE filed on 17 June 2008. 



2. 



Claims 1 -3, 5-7, 10-13, 15-17, and 20 are pending. 



Claim Rejections - 35 USC § 103 



3. The following is a quotation of 35 U.S.C. 1 03(a) which forms the basis for all 
obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set 
forth in section 102 of this title, if the differences between the subject matter sought to be patented and 
the prior art are such that the subject matter as a whole would have been obvious at the time the 
invention was made to a person having ordinary skill in the art to which said subject matter pertains. 
Patentability shall not be negatived by the manner in which the invention was made. 



4. Claims 1 , 3, 5, 1 1, 13, and 15 are rejected under 35 U.S.C. 103(a) as being 
unpatentable over Abe et al (USPN 6,990,443) in view of De Lima Araujo et al 
("Formant Frequency Estimation Using a Mel Scale LPC Algorithm", In Proc. of ITS '98, 
IEEE Intl., Vol. 1, 1998, pp. 207-212). 



Abe discloses a method for classifying an audio signal (Abstract), said method 
comprising: 

calculating a plurality of linear prediction coefficients (LPC) for a portion of the 
audio signal (FIG. 3, item 13 and related text); 

inverse filtering the portion of the audio signal with the plurality of linear 
prediction coefficients (LPC), thereby resulting in a residual signal (FIG. 3, item 20 and 
related text); 



Claim 1: 
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measuring the residual energy of the residual signal (FIG. 3, items 20, 36, and 
related text); and 

comparing the residual energy to a threshold ("extracting the characteristic 
quantity of a signal ... and classifying the signal ... according to the characteristic 
quantity thereof, col. 3, lines 43-51 ). 

Abe does not explicitly disclose decimating the portion of audio, thereby causing 
the portion to comprise a predetermined number of samples. 

In a speech processing method, De Lima Araujo discloses decimating a portion 
of a speech signal (page 209, col. 1 , paragraph 5). 

It would have been obvious to one with ordinary skill in the art at the time of the 
invention to decimate Abe's audio signal thereby causing the portion to comprise a 
predetermined number of samples in order to reduce fusion errors (De Lima Araujo, 
page 209, col. 2, paragraph 1). 

Claim 3: 

Abe and De Lima Araujo disclose the method of claim 1 , Abe further discloses 
wherein the portion of the audio signal comprises a frame (FIG. 2 and related text). 
Claim 5: 

Abe and De Lima Araujo disclose the method of claim 1, further comprising: 
spectrally flattening the portion of the audio signal (FIG. 3, item 15 and related text). 
Claims 11, 13 and 15: 

System claims 11, 13, and 15 and method claims 1, 3, and 5 are related as 
system and the method of using same, with each claimed element's function 
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corresponding to the claimed method step. Accordingly claims 11, 13, and 1 5 are 
rejected with the same rationale as applied above with respect to method claims 1 , 3, 
and 5. 

5. Claims 2 and 12 are rejected under 35 U.S.C. 103(a) as being unpatentable over 
Abe et al (USPN 6,990,443) in view of De Lima Araujo et al ("Formant Frequency 
Estimation Using a Mel Scale LPC Algorithm", In Proc. of ITS '98, IEEE Intl., Vol. 1, 
1 998, pp. 207-21 2) and in further view of Koishida et al (USPN 6,658,383). 
Claim 2: 

Abe and De Lima Araujo disclose the method of claim 1, but they do not explicitly 
disclose classifying the portion of the audio signal as music, if the residual energy 
exceeds the threshold; and classifying the portion of the audio signal as speech, if the 
threshold exceeds the residual energy. 

In a method, similar to Abe's, of audio classification, Koishida teaches that "linear 
prediction-based techniques such as CELP can deliver high quality reproduction for 
speech signals, but yield unacceptable quality for the reproduction of music signals" 
(col. 1, lines 33-37). 

It would have been obvious to one with ordinary skill in the art at the time of the 
invention to use Abe's residual energy to classify speech and music because as 
Koishida teaches LPC (linear predictive coding) techniques model speech better than 
they do music, therefore giving a smaller error (residual energy) for speech signals 
compared to the error for music signals. 
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Claim 12: 

System claim 12 and method claim 2 are related as system and the method of 
using same, with each claimed element's function corresponding to the claimed method 
step. Accordingly claim 12 is rejected with the same rationale as applied above with 
respect to method claim 2. 

6. Claims 6, 1 0, 1 6, and 20 are rejected under 35 U.S.C. 1 03(a) as being 
unpatentable over Abe et al (USPN 6,990,443) in view of Rabiner, L ("Digital Processing 
of Speech Signals", Prentice-Hall, 1 978) and in further view of De Lima Araujo et al 
("Formant Frequency Estimation Using a Mel Scale LPC Algorithm", In Proc. of ITS '98, 
IEEE Intl., Vol. 1, 1998, pp. 207-212). 
Claim 6: 

Abe discloses a method for classifying an audio signal (Abstract), said method 
comprising: 

taking a discrete Fourier transformation of a portion of the audio signal for a 
plurality of frequencies (FIG. 3, item 12 and related text); 

calculating a plurality of linear prediction coefficients (LPC) for the portion of the 
signal (FIG. 3, item 13 and related text). 

Abe does not explicitly disclose measuring an inverse filter response and a mean 
squared error as claimed. 
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In a textbook on speech processing, Rabiner discloses measuring an inverse 
filter response for said plurality of frequencies with said plurality of linear prediction 
coefficients (LPC) (page 433, equation 8.102 and related text); and 

measuring a mean squared error between the discrete Fourier transformation of 
the portion of the audio signal for the plurality of frequencies and the inverse filter 
response (page 433, equation 8.103b and related text); 

It would have been obvious to one of ordinary skill in the art at the time of the 
invention to perform Abe's method in the frequency domain in order to "see peaks at the 
formant frequencies" (Rabiner, page 433, lines 3-5). 

Abe further discloses comparing the means squared error to a threshold 
("extracting the characteristic quantity of a signal ... and classifying the signal ... 
according to the characteristic quantity thereof, col. 3, lines 43-51). 

Abe does not explicitly disclose decimating the portion of audio, thereby causing 
the portion to comprise a predetermined number of samples. 

In a speech processing method, De Lima Araujo discloses decimating a portion 
of a speech signal (page 209, col. 1 , paragraph 5). 

It would have been obvious to one with ordinary skill in the art at the time of the 
invention to decimate Abe's audio signal thereby causing the portion to comprise a 
predetermined number of samples in order to reduce fusion errors (De Lima Araujo, 
page 209, col. 2, paragraph 1). 

Claim 10: 
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Abe, Rabiner, and De Lima Araujo disclose the method of claim 6, De Lima 
Araujo further discloses: Spectrally flattening (pre-emphasis) the portion of the audio 
signal (page 209, col. 2, paragraph 2). 

Claims 16 and 20: 

System claims 16 and 20 and method claims 6 and 10 are related as system and 
the method of using same, with each claimed element's function corresponding to the 
claimed method step. Accordingly claims 16 and 20 are rejected with the same rationale 
as applied above with respect to method claims 6 and 10. 

7. Claims 7 and 17 are rejected under 35 U.S.C. 103(a) as being unpatentable over 
Abe et al (USPN 6,990,443) in view of Rabiner, L ("Digital Processing of Speech 
Signals", Prentice-Hall, 1978), in view of De Lima Araujo et al ("Formant Frequency 
Estimation Using a Mel Scale LPC Algorithm", In Proc. of ITS '98, IEEE Intl., Vol. 1, 
1 998, pp. 207-21 2), and in further view of Koishida et al (USPN 6,658,383). 
Claim 7: 

Abe, Rabiner, and De Lima Araujo disclose the method of claim 8, but they do 
not explicitly disclose: classifying the portion of the audio signal as music, if the mean 
squared error exceeds the threshold; and classifying the portion of the audio signal as 
speech, if the threshold exceeds the means squared error energy. 

In a method, similar to Abe's, of audio classification, Koishida teaches that "linear 
prediction-based techniques such as CELP can deliver high quality reproduction for 
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speech signals, but yield unacceptable quality for the reproduction of music signals" 
(col. 1, lines 33-37). 

It would have been obvious to one with ordinary skill in the art at the time of the 
invention to use Abe and Rabiner's error to classify speech and music because as 
Koishida teaches LPC (linear predictive coding) techniques model speech better than 
they do music, therefore giving smaller error for speech signals compared to the error 
for music signals. 

Claim 17: 

System claim 17 and method claim 7 are related as system and the method of 
using same, with each claimed element's function corresponding to the claimed method 
step. Accordingly claim 17 is rejected with the same rationale as applied above with 
respect to method claim 7. 

Conclusion 

Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to Samuel G. Neway whose telephone number is 571-270- 
1058. The examiner can normally be reached on Monday - Friday 8:30AM - 5:30PM 
EST. 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, David R Hudspeth can be reached on 571-272-7843. The fax phone 
number for the organization where this application or proceeding is assigned is 571- 
273-8300. 
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Information regarding the status of an application may be obtained from the 
Patent Application Information Retrieval (PAIR) system. Status information for 
published applications may be obtained from either Private PAIR or Public PAIR. 
Status information for unpublished applications is available through Private PAIR only. 
For more information about the PAIR system, see http://pair-direct.uspto.gov. Should 
you have questions on access to the Private PAIR system, contact the Electronic 
Business Center (EBC) at 866-217-9197 (toll-free). If you would like assistance from a 
USPTO Customer Service Representative or access to the automated information 
system, call 800-786-9199 (IN USA OR CANADA) or 571-272-1000. 

/David R Hudspeth/ 

Supervisory Patent Examiner, Art Unit 2626 

IS. G. N./ 

Examiner, Art Unit 2626 



